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Abstract 

We present a method to compensate statistical errors in the calculation of correlations on asynchronous time series. 
The method is based on the assumption of an underlying time series. We set up a model and apply it to financial data 
to examine the decrease of calculated correlations towards smaller return intervals (Epps effect). We show that this 
statistical effect is a major cause of the Epps effect. Hence, we are able to quantify and to compensate it using only 
trading prices and trading times. 
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1. Introduction 

The decrease of calculated correlations in financial data towards smaller return (or "sampling"-) intervals has been 
of interest since Epps discovered this phenomenon in 1979 111]. Ever since, this behavior was found in data of different 
stock exchanges HSBEtl and foreign exchange markets jaBl- 

Many economists as well as physicists addressed this phenomenon, since a precise calculation of correlations 
is of major importance for the estimation of financial risk JUL H Eft. While the physicists' approach is often to 
construct a model which offers an explanation for this phenomenon, the standard economy approach is to work 
on estimators with the aim to suppress the Epps effect. Recently, Hayashi and Yoshida introduced a cumulative 



estimator 111 111 , only involving returns whose time intervals are overlapping. This estimator has been supplemented 



with different adjustments, such as bias compensation and lead-lag treatment @,[l2, 13]. A very similar approach 
on a completly different topic is the "discrete correlation function" in astrophysics which was introduced in 1988 
by Edelson and Krolik ffl . Other approaches to estimate correlation coefficients involve Previous-Tick-Estimators 
1 15, IjJ or realized kernel functions fl7Tl . 



An extensive study of microscopic causes leading to the Epps effect has been performed by Reno [18], while 
another work by Toth et. al. introduce a model for the Epps effect which is based on the phenomenon of lagged 
correlations B19I1 . 

However, certainly miscellaneous mechanisms are contributing to the Epps effect. Thus our approach is different. 
First, we will introduce a simple model which offers an explanation for the statistical part of the Epps effect, based 
on a central assumption of an underlying time series. Secondly, based on that model, we will present an estimator, 
with which these effects can be compensated. Finally, we will quantify the impact of this phenomenon on the Epps 
effect in recent empirical data and show that it can be a major cause for the Epps effect, especially when looking at 
less frequently traded securities. 

This paper is organized as followed: In section [2] we develop the model for correlations in asynchronous time 
series. Within the model, we observe a decay of correlations towards smaller return intervals, similar to the Epps 
effect. We then derive the method to compensate this phenomenon. In section [3] this method is applied to recent 
empirical data to estimate the impact of the observed effect on the Epps effect. We discuss the results in section|4] 



•Corresponding author. Tel.: +49 203 379 4727; Fax: +49 203 379 4732. 
Email address: michaelSmuennix.com (Michael C. Munnix) 



Preprint submitted to Physica A 



July 7, 2010 



— Stock 1 

— Stock 2 




Figure 1 : Illustration of the model for asynchronous trading times of two stocks. Shown above are the prices § on the underlying timescale. The 
"sampling" of theses prices 5 to prices 5 on simulated trading times are shown below. 



2. Statistical effects in asynchronous time series 

In section |2~T1 we set up our model and develop a compensation formalism for asynchrony effects in section l2~2l 

2.1. Model 

The central assumption of our model is the existence of an underlying non-lagged time series of prices. The 
assumption of a finer IU9ll or even continuous | IT , 2(J IS ] underlying timescale is a common approach in the estimation 
of correlations. This approach is also intuitive, as most stocks are traded at several stock exchanges simultaneously. 

To simulate asynchrony effects we generate an underlying correlated time series using the Capital Asset Pricing 
Model (CAPM) 12111 , which is also known as Noh's model 12211 in physics, 



r.d) 



r 



{t) = yfcrjit) + Vl -cs {i) {t) , (1) 



where stands for the relative price change, the so-called return of the z'-th stock and c is the correlation coefficient. 
The random variables 77 and are taken from a compound distribution as observed on market data by Gopikrishnan 
et. al. with power-law tails and a central Levy distribution (for details see Ref. I23I0 . We have chosen this approach 
to keep our model initially as simple as possible. We note, however, that return time series can also be autocorre- 



lated. While first order autocorrelations are in this context insignificantly small 12411 . second order autocorrelations or 
"volatility clustering" represent a strong characteristic of return time series and led to the development of autoregres- 
sive models, such as GARCH 125l 12611 . For this reason we also test our compensation in a more realistic setup against 
a GARCH(1,1) generated time series of underlying returns, given by 



f {i \t) = o®(i) (^n(t) + Vl -c£ (0 (f)) (2) 



with 



(cr®(0) =a + ai (r (i \t-l)) +fi 1 (o®(t-V>) . (3) 
The initial parameters of the GARCH process have been chosen as q-q = 2.4 • 10~ 4 , a\ = 0.15 and [5\ = 0.84. 
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Figure 2: Scaling behavior of the correlation coefficient on a simulated asynchronous time series. The length of the underlying time series was set 
to 7.2 10 6 (c = 0.4, //'' = 15 and /jP^ = 25). The correlation coefficient was calculated on return-intervals from 60 data points (corresponding to 1 
minute) to 1 800 data points (corresponding to 30 minutes). 



Two return time series and r (2) are generated representing two correlated stocks. The lengths of these underly- 
ing time series are chosen as 7.2 ■ 10 6 , 1.44 • 10 6 and 7.2 ■ 10 5 corresponding to a return interval At on the underlying 
timescale of 1, 5 and 10 seconds during 1 trading year. 

Using these returns and an arbitrary starting price, the underlying price series S (l) and 5 (2) are calculated implying 
a geometric Brownian motion with zero drift and a standard deviation of 10~ 3 per time step. To model the asyn- 
chronous trade processes, these prices are sampled independently using exponentially distributed waiting times with 
average values typical for the stock market (see Fig. Q]). In the following example, we choose the average waiting 
times as = 15 and // 2) = 25 (equivalent to seconds in this example), while the underlying time series were cor- 
related with c = 0.4. On the resulting "macroscopic" time series, the return between two points in time (of the i-th 
stock) can be calculated as 



( , _ S'Ht + At)-S^(t) 
A ' W S»(t) 



(4) 



where S w (f) denotes the price at time t and At is the return interval. Between these return time series, we now calculate 
the correlation coefficient, 



, >\_/ r «\/ r C/>\ 

/ (i) (;'k \ At At / \ At/\ At ,- s 

corr(r« r«) = ^ ( , -) ' (5) 

At At 

where (. . .) denotes the mean value of a time series with length T and where cr refers to the standard deviation of the 
same time series. We note that we refer to the whole time series of returns r ( £ f when the argument (f) is omitted. 

When calculating the correlation of returns of the sampled time series S using different return intervals At, the 
correlation coefficient scales down as shown in Fig [2] This behavior is very similar to the Epps effect in empirical 
data. It occurs only because of the asynchrony of the trading times. As this behavior is already observed in this simple 
setting, we are able to derive a method to compensate it, as the following demonstrates. 

2.2. Compensation 

The basic idea of this approach is the following: Due to the asynchrony, each term of the correlation coefficient 
can be divided into a part which contributes to the correlation and a part which is uncorrected and therefore lowers 
the correlation coefficient. 

According to the model assumption, the price change during At is based on price changes on an underlying 
"microscopic" timescale. Thus, the return can also be expressed as a sum of the underlying returns, 



3 



<(') 

r {i \t) = £ r®(y®(*) + jAt) . (6) 

7=0 

Here r w (f,) is the return related to S{t) on the underlying time scale of non-overlapping intervals At (e.g. 1 second) 
given by 

rnit + mJ« + u+1 ™- S « + J **. (7) 

The quantity y w (f) in equation (O represents the time of the last trade of the 2-th stock at time t, 

r (0 (0 = max(/®j| (8) 

'trade-' 

When calculating the return for the interval [t, t + At] of two stocks, the actual price at t and t + At is generally in 
the past, more precisely at y m {t), y (2) (f) and y (1) (f + At), y {2 \t + At). These trading times are distinct for each stock, 
therefore only a fraction of the underlying prices processed by the return is correlated. The number of terms Nv. of 
the sum in equation (O is given by 

(y (i) (f + Af) - y (i) (f)) 



We normalize the returns to zero mean and unit variance and indicate them as g and g : 



(0 ' * ^ =-© 

Al 

In this context, the relation of the returns on both time scales in equation (|6]l changes to 



*2® = Vl I sVco + m - ( >u w Af()j . (id 



as worked out in appendix Appendix A When using normalized returns, the correlation coefficient of two return time 
series r j and r^J (see equation (f5]l) simplifies to 

1 T 

corr(rM) = corr(^,^) = -^g^g^tj) . (12) 

j=o 

As the mean value over T of the second term from equation (fTTT i is equal to zero, we obtain in terms of the underlying 
time series 



corr(r« r< 2) ) = 1 £ J g'Wj) + kAt) £ |< 2 >(y (2 ^) + W 



T 

j=0 \ k=0 1=0 



(13) 



As illustrated in Fig. [3] only a subset of the underlying prices S of two prices S share an overlapping time-interval. 
Because of this "overlap" only a certain amount N&,(t) of the underlying returns is correlated, namely 

**G) = ^ (14) 

with Af (f) being the time interval of the actual overlap, 

At (t) = min(y (1) (f + Af),y (2) (f + Af)) - max(y (1) (f),7 (2) (0) . (15) 
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Figure 3: Illustration of the overlap A; . 



Each sum can be split up into iVYi ~ N terms that are uncorrelated and N that are correlated. Thus, equation ( fT3b can 
be written as: 
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(16) 



where only the sums of synchronous returns are correlated among each other. In this notation, the underlying time 
series is indexed as [f (l \to), f^\t\), . . . r ( '\t N m))], where the returns from to to tf) are corresponding to the overlap. 
When expanding the product, the non-correlated returns converge to zero due to the outer average 
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Figure 4: Distribution of the overlaps for simulated return intervals At = 150 (a), 450 (b) and 1500 (c) data points (corresponding to 2.5, 7.5 and 
25 minutes). Towards larger return-intervals, the distribution sharpens, as well as its mean converges to 1 (d). 



where coir, represents the correlation of the underlying returns corresponding to the interval [f, t + Af]. 

Af D (f)/Af is the fractional overlap of the corresponding return interval. The fractional overlap does not depend 
on the actual timescale of the underlying time series. As equation (fTTT i clearly shows, the correlation coefficient 
of the synchronous part of the return time series is multiplied by the fractional overlap. Hence, this effect can be 
compensated by 

T 

corr con - ected (^,^) = I^g^^^JL ■ (18) 

The dashed line in Fig. |5]represents the asynchrony-compensated correlation within our simulation. It turns out that 
there is a remaining effect that still causes a downscaling of the correlation coefficient for very small return intervals. 
This behavior occurs when the price of either of the stocks did not change during the return-interval and therefore 
the corresponding return equals zero. Of course, this event becomes more probable on smaller return intervals Af. It 
corresponds to the small peak at Af Q = in Fig . |4(a)| This remaining downscaling coincides with the cumulative 



estimator described by Hayashi and Yoshida [11]. It can also be expressed in the formalism used here. It reads 



corrfr ' r') (19) 

A/ ' Af |( r (')(,) #r (i)(, + A,)) A ( r (2) ( , )#r (2) ( , +A , )) ' 



Therefore, when combining both estimators, and thus only regarding returns with overlapping time intervals, the 
remaining scaling behavior for very small returns can be compensated as well. 

As displayed in Fig. |4] the overlap can also be larger than the actual return interval, implying that terms with 
such overlaps are corrected downwards. Therefore the compensation can amplify a specific term of the correlation 
coefficient as well as it can attenuate it. 
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Figure 5 : Compensation of asynchrony effects within the model with different approaches in the generation of the underlying time series. The 
dashed line represents the correlation coefficient, which is corrected by the overlap. The solid line regards in addition only returns, in which time 
intervals trades occurred. 



3. Application to market data 

Certainly, many aspects contribute to the Epps effect. Our present aim is to quantify the part, which is caused by 
the asynchrony of the time series. 

It is difficult to isolate the Epps Effect on single stock pairs, as it can superimpose with other effects leading to 
other characteristics of the correlation coefficient than expected according to the Epps effect. A common approach 
on this topic is to pick the pairs of stocks, which show a distinctive Epps effect and focus the analysis to these pairs 
S In the following, we would like to take a different approach: 

We classify two ensembles of stock pairs. After compensating the asynchrony effect for each pair, we build the 
average for the ensemble. We also plot the error bars representing the double standard deviation 2<x. By this method, 
we can show the scope of the asynchrony model and identify regions, in which other effects dominate. All data was 



extracted from the NYSE's TAQ database for the year 2007 (27 ]. 

The first ensemble consists of stock pairs which provide the most stable correlation. Thereby we want to suppress 
those effects which are caused by a change in the correlation during the period in which the correlation coefficient is 
calculated. This ensemble represents ideal test conditions for the asynchrony compensation. To identify those stock 
pairs with a stable correlation, we calculate the correlation coefficient of 30 daily returns. After shifting this window 
in 1-day intervals through the year, we calculate the variance of the obtained correlation coefficients (var corl ). Then we 
identify the five stocks providing the smallest variance for each Global Industry Classification System (GICS) branch 



of the Standard & Poor's (S&P) 500 index. This results in an ensemble of 50 stocks as shown in table IB. ll appendix 



Appendix B 



As the correlation structure of stocks can be non-stationary, we also evaluate the asynchrony compensation without 
the restriction to stable correlations. For this purpose, we select a second ensemble consisting of 5 stock pairs of each 
GICS branch of the S&P 500 index, whose daily returns are providing the strongest correlation during the year 



2007. These stocks include highly non-stationary correlations as indicated in table IB. 2| appendix Appendix B (row 
"var corr "). 

Fig. [6] shows the ensemble average of the correlation coefficient and the asynchrony-compensated correlation 
coefficient for both ensembles in 2007 (250 trading days). Before averaging, the correlation coefficients for each 
stock have been normalized to the value at a return interval Af = 40 minutes. 

When looking at the whole ensemble we discover that the asynchrony has a pronounced impact on the Epps effect. 
The asynchrony effect seems to be the dominating cause for the Epps effect on return intervals down to approximately 
10 minutes, where the remaining Epps effect is on average less than 3% of the correlation coefficient's saturation value 
at large return intervals. For smaller return intervals, other effects dominate, e.g. a lag between the time series of two 
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Figure 6: Asynchrony-compensated correlations of two ensembles. The data has been normalized to its value at 40 minutes. The error bars 
represent the double standard deviation. 
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(a) D - XEL (b) AMGN - GENZ 

Figure 7: Asynchrony-compensated correlations between Dominion Resources, Inc. (D) - Xcel Energy Inc. (XEL) and Amgen Inc. (AMGN) - 
Genzyme Corp. (GENZ) 



stocks, as recent study indicates lfl9ll . 

However, the ensemble consists also of stocks which are very frequently traded, providing a very short average 
waiting time which results in a fractional overlap A /o (f)/Af close to unity. Evidently the presented compensation only 
has a small impact on the correlation estimation of these stocks, as they are so frequently traded that their time series 
can almost be described as continuous. Naturally the presented compensation works best for less frequently traded 
stocks, as they actually show an asynchronous behavior. Fig. Qprovides two examples of stock pairs for which the 
asynchrony of time series is a major effect. While Fig. |7(a)| shows a "clean" Epps effect, Fig. |7(b)| shows an Epps 
effect which is superimposed with other phenomena. 

Of course, within the statistical ensemble stock pairs can be found that either do not show an Epps effect or that 
are so infrequently traded that the assumption of an underlying timeline seems to be unreasonable. Even though the 
assumption of an underlying time series is a common and intuitive approach on this topic, it may not be valid for very 
infrequently traded stocks. 

When looking at single stock pairs, it turns out that the asynchrony-compensation works well, if a distinguished 
Epps effect is found. In case of adopting the presented method as a black box model without looking at the scaling 



behavior of the correlation coefficient, we believe that a return interval of 5 minutes represents a good lower bound 
for the scope of this method. 



4. Conclusion 

We presented a model for the scaling behavior of financial correlations due to the asynchrony of the time series. 
This purely statistical effect can be compensated. Furthermore, we applied this compensation to market data under the 
assumption of an underlying time series with non-lagged correlations. We quantified the influence of the asynchrony 
on the overall decay of the correlation coefficient towards small return intervals, which is known as the Epps effect. 
The results clearly demonstrate that the asynchrony can have a huge impact on the Epps effect. It rather can be 
the dominating cause for less frequently traded stocks. The main advantage of our method is that no parameters or 
adjustments are necessary, since it is based on the trading times only. 

In our empirical study, the asynchrony-compensation allowed us to recover the correlation coefficient for return 
intervals down to 10 minutes. At this return interval, the remaining Epps effect is on average less than 3% of the 
correlation coefficient's saturation value at large return intervals. We also demonstrated that the presented method 
holds for non-stationary correlated time series. The accurate calculation of correlations is of major importance for 
risk management. To keep the estimation error small, a long time series of returns is required. Yet at the same time, 
the time series should not reach too far into the past. The latter is important because the correlation structure can 
be highly dynamic, as the dramatic events of autumn 2008 prove. Applying our method to intraday data allows to 
choose smaller return intervals and hence provides improved statistical significance of the correlations for the same 
time horizon. 
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We defined the normalzed returns as 



Appendix A. Relation between g® and g U) 



J « (0 = r fi£ w^> (A1) 



VVar(r«) 

where Var(- • • ) refers to the variance of a time series. Inserting the return, expressed through the underlying time 
series, 

<>(') 

r%) = J] ^(7%) + m , (A.3) 
;=o 

in equation ( 1A.U , results in 



(0 = 



Z (r®(r (0 W+;'Ai))-(r®) 

7=0 



.<(0 



VVar(rC')) £ (W W + m) ~ <^> + N%(f)(f®) 

7=0 



(A.4) 



(A.5) 



In equation ( IA.5I ). ( 1A.2I I was used to express the underlying returns f ( 'K 
The mean values and variance are additive, which leads to 

Therefore, we obtain 

, <r®> f/jV (i) (f)\ - N & (t)) 

g > = — J= 2 j» ( yo(o + y'A?) - {r . (A . 7) 

As the average time interval per return converges to Af, the mean number of underlying price changes (.N®(t)\ is given 
by At /At. Thus, we arrive at 



= V at Z ^ (0 « + - H ^Jr )j , (a.8) 



which is equation (fTTT i. 
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Appendix B. Stock ensembles 



Table B.l: Top 5 five stock pairs with the most stable correlation from each GICS branch of the S&P 500 index. 
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AMZN 


Amazon Corp. 


NASDAQ 


1 2 1 5700 


SBUX 


Starbucks Corp. 


NASDAQ 


1287200 


0.80 


0.45 




APOL 


Apollo Group 


NASDAQ 


332200 


SHLD 


Sears Holdings Corporation 


NASDAQ 


317900 


0.28 


0.57 


Consumer Discretionary 


AMZN 


Amazon Corp. 


NASDAQ 


1 2 1 5700 


SPLS 


Staples Inc. 


NASDAQ 


674500 


0.80 


0.62 




AMZN 




NASDAQ 








NASDAQ 


910000 








EXPE 


Expedia Inc. 


NASDAQ 


883300 


SHLD 


Sears Holdings Corporation 


NASDAQ 


317900 


0.43 


0.66 




COST 


Costco Co. 


NASDAQ 


237500 


PG 


Procter & Gamble 


NYSE 


1396077700 


0.05 


0.71 




COST 


Costco Co. 


NASDAQ 


237500 


CVS 


CVS Caremark Corp. 


NYSE 


1454768200 


0.09 


0.77 


Consumer Staples 


KO 


Coca Cola Co. 


NYSE 


1152559200 


COST 


Costco Co. 


NASDAQ 


237500 


0.04 


0.79 




MO 


Altria Group Inc. 


NYSE 


1312801100 


CCE 


Coca-Cola Enterprises 


NYSE 


335449400 


0.29 


0.79 




CCE 


Coca-Cola Enterprises 


NYSE 


335449400 


KFT 


Kraft Foods Inc -A 


NYSE 


1438368700 


0.32 


0.80 




EP 


El Paso Corp. 


NYSE 


697103700 


SE 


Spectra Energy Corp. 


NYSE 


331523100 


0.34 


0.84 




cvx 


Chevron Corp. 


NYSE 


1271849400 


SE 


Spectra Energy Corp. 


NYSE 


331523100 


0.26 


0.85 


Energy 


HES 


Hess Corporation 


NYSE 


427802700 


SE 


Spectra Energy Corp. 


NYSE 


331523100 


0.23 


0.85 




MUR 


Murphy Oil 


NYSE 


223916300 


SE 


Spectra Energy Corp. 


NYSE 


331523100 


0.25 


0.86 




SII 


Smith International 


NYSE 


370893400 


SE 


Spectra Energy Corp. 


NYSE 


331523100 


0.32 


0.86 




SCHW 


Charles Schwab 


NASDAQ 


1445500 


ETFC 


ETrade Financial Coip. 


NASDAQ 


1391800 


0.67 


0.44 




ETFC 


E* Trade Financial Corp. 


NASDAQ 


1391800 


FITB 


Fifth Third Bancorp 


NASDAQ 


218000 


0.29 


0.58 


Financials 


SCHW 


Charles Schwab 


NASDAQ 


1445500 


FITB 


Fifth Third Bancorp 


NASDAQ 


218000 


0.36 


0.58 




SCHW 


Charles Schwab 


NASDAQ 


1445500 


HCBK 


Hudson City Bancorp 


NASDAQ 


686900 


0.54 


0.60 




ACAS 


American Capital Strategies Ltd 


NASDAQ 


207200 


SCHW 


Charles Schwab 


NASDAQ 


1445500 


0.50 


0.60 




CELG 


Celgene Corp. 


NASDAQ 


619200 


ESRX 


Express Scripts 


NASDAQ 


998400 


0.65 


0.47 




AMGN 


Amgen 


NASDAQ 


813900 


CELG 


Celgene Corp. 


NASDAQ 


619200 


0.48 


0.50 


Health Care 


AMGN 


Amgcn 


NASDAQ 


813900 


BIIB 


BIOGEN IDEC Inc. 


NASDAQ 


381800 


0.48 


0.52 




CELG 


Celgene Corp. 


NASDAQ 


619200 


THC 


Tenet Healthcare Corp. 


NYSE 


805228900 


-0.23 


0.53 




AMGN 


Amgen 


NASDAQ 


813900 


GENZ 


Genzyme Corp. 


NASDAQ 


242900 


0.57 


0.53 




GE 


General Electric 


NYSE 


4303823300 


LUV 


Southwest Airlines 


NYSE 


862775700 


0.53 


0.73 




MMM 


3M Company 


NYSE 


549124400 


CBE 


Cooper Industries Ltd. 


NYSE 


175911300 


0.24 


0.73 


Industrials 


CBE 


Cooper Industries Ltd. 


NYSE 


175911300 


GWW 


Grainger (WW.) Inc. 


NYSE 


95324400 


0.15 


0.73 




CBE 


Cooper Industries Ltd. 


NYSE 


175911300 


GR 


Goodrich Corporation 


NYSE 


144177800 


0.15 


0.75 




CBE 


Cooper Industries Ltd. 


NYSE 


175911300 


FLR 


Fluor Coip. (New] 


NYSE 


171713100 


0.10 


0.76 




AAPL 


Apple Inc. 


NASDAQ 


4627500 


INTC 


Intel Corp. 


NASDAQ 


5529100 


0.84 


0.19 




AAPL 


Apple Inc. 


NASDAQ 


4627500 


CSCO 


Cisco Systems 


NASDAQ 


4886800 


0.76 


0.24 


Information Technology 


AAPL 


Apple Inc. 


NASDAQ 


4627500 


YHOO 


Yahoo Inc. 


NASDAQ 


2609700 


0.71 


0.25 




AAPL 


Apple Inc. 


NASDAQ 


4627500 


ORCL 


Oracle Coip. 


NASDAQ 


1731900 


0.78 


0.25 




AAPL 


Apple Inc. 


NASDAQ 


4627500 


EBAY 


eBay Inc. 


NASDAQ 


1056100 


0.73 


0.26 




MON 


Monsanto Co. 


NYSE 


479605700 


SEE 


Sealed Air Corp. (New) 


NYSE 


126716200 


0.06 


0.52 




FCX 


Freeporl-McMoran Cp & Gld 


NYSE 


1058215000 


SIAL 


Sigma- Aldrich 


NASDAQ 


133800 


0.05 


0.54 


Materials 


ECL 


Ecolab Inc. 


NYSE 


163404500 


SEE 


Sealed Air Corp. (New) 


NYSE 


126716200 


0.26 


0.60 




ATI 


Allegheny Technologies Inc 


NYSE 


269746100 


SEE 


Sealed Air Corp. (New) 


NYSE 


126716200 


0.10 


0.63 




PX 


Praxair Inc. 


NYSE 


245761900 


SEE 


Sealed Air Corp. (New) 


NYSE 


126716200 


0.12 


0.65 




Q 


Qwest Communications Inl 


NYSE 


1623807700 


S 


Sprint Nexlel Corp. 


NYSE 


2044634000 


0.52 


0.84 




Q 


Qwest Communications Inl 


NYSE 


1623807700 


VZ 


Verizon Communications 


NYSE 


1472335800 


0.49 


0.86 


Telecommunication Services 


S 


Sprint Nexlel Corp. 


NYSE 


2044634000 


VZ 


Verizon Communications 


NYSE 


1472335800 


0.49 


0.87 




AMT 


American Tower Corp. 


NYSE 


387199400 


Q 


Qwest Communications Inl 


NYSE 


1623807700 


0.26 


0.97 




AMT 


American Tower Corp. 


NYSE 


387199400 


WIN 


Wind st ream Coip oral ion 


NYSE 


400634200 


0.10 


0.99 




DUK 


Duke Energy 


NYSE 


902519200 


DYN 


Dynegy Inc. 


NYSE 


702035600 


0.54 


0.74 




CMS 


CMS Energy 


NYSE 


264225200 


DYN 


Dynegy Inc. 


NYSE 


702035600 


0.48 


0.79 


Utilities 


CMS 


CMS Energy 


NYSE 


264225200 


DUK 


Duke Energy 


NYSE 


902519200 


0.39 


0.81 




AES 


AES Corp. 


NYSE 


556049300 


CMS 


CMS Energy 


NYSE 


264225200 


0.31 


0.84 




CNP 


CenterPoinl Energy 


NYSE 


359757800 


DUK 


Duke Energy 


NYSE 


902519200 


0.33 


0.84 
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Table B.2: Top 5 five stock pairs with the highest correlation from each GICS branch of the S&P 500 index. 



G1CS Branch 




Slock 1 








Stock 2 






corr 


varcorr 




Symbol 


Name 


Slock Exchange 


Volume 


Symbol 


Name 


Slock Exchange 


Volume 








APOL 


Apollo Group 


NASDAQ 


332200 


SPLS 


Staples Inc. 


NASDAQ 


674500 


0.77 


1.08 




BBBY 


Bed Bath & Beyond 


NASDAQ 


233000 


SPLS 


Staples Inc. 


NASDAQ 


674500 


0.79 


1.08 


Consumer Discretionary 


AMZN 


Amazon Corp. 


NASDAQ 


1215700 


SPLS 


Staples Inc. 


NASDAQ 


674500 


0.80 


0.62 




AMZN 


Amazon Corp. 


NASDAQ 


1215700 


SBUX 


Starbucks Corp. 


NASDAQ 


1287200 


0.80 


0.45 




SPLS 


Staples Inc. 


NASDAQ 


674500 


SBUX 


Starbucks Corp. 


NASDAQ 


1287200 


0.81 


0.69 




KO 


Coca Cola Co. 


NYSE 


1152559200 


SLE 


Sara Lee Corp. 


NYSE 


542934700 


0.42 


0.90 




SLE 


Sara Lee Corp. 


NYSE 


542934700 


WMT 


Wal-Mart Stores 


NYSE 


1992433400 


0.43 


1.02 


Consumer Staples 


CVS 


CVS Caremark Corp. 


NYSE 


1454768200 


KFT 


Kraft Foods Inc-A 


NYSE 


1438368700 


0.44 


0.98 




KFT 


Kraft Foods Inc-A 


NYSE 


1438368700 


SLE 


Sara Lee Corp. 


NYSE 


542934700 


0.48 


0.87 




COST 


Costco Co. 


NASDAQ 


237500 


WFMI 


Whole Foods Market 


NASDAQ 


211400 


0.59 


1.09 




EP 


El Paso Corp. 


NYSE 


697103700 


RDC 


Rowan Cos. 


NYSE 


369346200 


0.39 


0.91 




cvx 


Chevron Corp. 


NYSE 


1271849400 


XOM 


Exxon Mobil Corp. 


NYSE 


2798325600 


0.41 


1.00 


Energy 


XOM 


Exxon Mobil Corp. 


NYSE 


2798325600 


HAL 


Halliburton Co. 


NYSE 


1701703200 


0.45 


1.18 




CVX 


Chevron Corp. 


NYSE 


1271849400 


EP 


El Paso Corp. 


NYSE 


697103700 


0.45 


0.88 




COP 


ConocoPhillips 


NYSE 


1382115600 


EP 


El Paso Corp. 


NYSE 


697103700 


0.46 


0.94 




CINF 


Cincinnati Financial 


NASDAQ 


55900 


TROW 


T. Rowe Price Group 


NASDAQ 


216400 


0.60 


1.74 




FITB 


Fifth Third Bancorp 


NASDAQ 


218000 


HBAN 


Huntington Bancshares 


NASDAQ 


71600 


0.62 


2.01 


Financials 


FITB 


Fifth Third Bancorp 


NASDAQ 


218000 


TROW 


T. Rowe Price Group 


NASDAQ 


216400 


0.63 


1.34 




SCHW 


Charles Schwab 


NASDAQ 


1445500 


ETFC 


E*Trade Financial Corp. 


NASDAQ 


1391800 


0.67 


0.44 




TROW 


T. Rowe Price Group 


NASDAQ 


216400 


ZION 


Zions Bancorp 


NASDAQ 


48500 


0.70 


1.40 




BI1B 


BIOGEN IDEC Inc. 


NASDAQ 


381 S00 


PDCO 


Patterson Cos. Inc. 


NASDAQ 


106400 


0.60 


1.15 




BIIB 


BIOGEN IDEC Inc. 


NASDAQ 


381800 


GENZ 


Genzyme Corp. 


NASDAQ 


242900 


0.62 


0.85 


Health Care 


GENZ 


Genzyme Corp. 


NASDAQ 


242900 


GILD 


Gilead Sciences 


NASDAQ 


275100 


0.64 


0.91 




CELG 


Celgcne Corp. 


NASDAQ 


619200 


ESRX 


Express Scripts 


NASDAQ 


998400 


0.65 


0.47 




BSX 


Boston Scientific 


NYSE 


1205569800 


THC 


Tenet Healthcare Corp. 


NYSE 


805228900 


0.68 


0.60 




GE 


General Electric 


NYSE 


4303823300 


LUV 


Southwest Airlines 


NYSE 


862775700 


0.53 


0.73 




CTAS 


Cintas Corporation 


NASDAQ 


48500 


EXPD 


Expeditors Int'l 


NASDAQ 


215600 


0.54 


1.48 


Industrials 


EXPD 


Exp editors IntT 


NASDAQ 


215600 


MNST 


Monster Worldwide 


NASDAQ 


196700 


0.55 


1.20 




CHRW 


C.H. Robinson Worldwide 


NASDAQ 


112400 


EXPD 


Expeditors Int'l 


NASDAQ 


215600 


0.56 


1.46 




MNST 


Monster Worldwide 


NASDAQ 


196700 


PCAR 


PACCAR Inc. 


NASDAQ 


164400 


0.56 


1.23 




DELL 


Dell Inc. 


NASDAQ 


909400 


ORCL 


Oracle Corp. 


NASDAQ 


1731900 


0.79 


0.47 




CSCO 


Cisco Systems 


NASDAQ 


4886800 


DELL 


Dell Inc. 


NASDAQ 


909400 


0.79 


0.41 


Information Technology 


INTC 


Intel Corp. 


NASDAQ 


5529100 


ORCL 


Oracle Corp. 


NASDAQ 


1731900 


0.82 


0.29 




CSCO 


Cisco Systems 


NASDAQ 


4886800 


ORCL 


Oracle Corp. 


NASDAQ 


1731900 


0.83 


0.34 




AAPL 


Apple Inc. 


NASDAQ 


4627500 


INTC 


Intel Corp. 


NASDAQ 


5529100 


0.84 


0.19 




DD 


Du Pont (EX) 


NYSE 


716944300 


FCX 


Frceporl-McMoran Cp & Gld 


NYSE 


1058215000 


0.36 


1.02 




FCX 


Frceport-McMoran Cp & Gld 


NYSE 


1058215000 


MON 


Monsanto Co. 


NYSE 


479605700 


0.36 


0.85 


Materials 


APD 


Air Products & Chemicals 


NYSE 


187953500 


BLL 


Ball Corp. 


NYSE 


119560600 


0.36 


0.88 




ECL 


Eeolab Inc. 


NYSE 


163404500 


NEM 


Newmont Mining Corp. (Hldg. Co.) 


NYSE 


958900000 


0.37 


0.93 




FCX 


Freeport-McMoran Cp & Gld 


NYSE 


1058215000 


NEM 


Newmonl Mining Corp. (Hldg. Co.) 


NYSE 


958900000 


0.43 


1.06 




T 


AT&T Inc. 


NYSE 


2663617200 


Q 


Qwest Communications Inl 


NYSE 


1623807700 


0.45 


1.01 




S 


Sprint Nexlel Corp. 


NYSE 


2044634000 


vz 


Verizon Communications 


NYSE 


1472335800 


0.49 


0.87 


Telecommunication Services 


Q 


Qwest Communications Inl 


NYSE 


1623807700 


vz 


Verizon Communications 


NYSE 


1472335800 


0.49 


0.86 




T 


AT&T Inc. 


NYSE 


2663617200 


vz 


Verizon Communications 


NYSE 


1472335800 


0.50 


1.05 




Q 


Qwest Communications Inl 


NYSE 


1623807700 


s 


Sprint Nexlel Corp. 


NYSE 


2044634000 


0.52 


0.84 




DUK 


Duke Energy 


NYSE 


902519200 


TE 


TECO Energy 


NYSE 


177983100 


0.35 


0.87 




D 


Dominion Resources 


NYSE 


321656100 


XEL 


Xcel Energy Inc 


NYSE 


337262500 


0.36 


0.92 


Utilities 


CMS 


CMS Energy 


NYSE 


264225200 


DUK 


Duke Energy 


NYSE 


902519200 


0.39 


0.81 




CMS 


CMS Energy 


NYSE 


264225200 


DYN 


Dynegy Inc. 


NYSE 


702035600 


0.48 


0.79 




DUK 


Duke Energy 


NYSE 


902519200 


DYN 


Dynegy Inc. 


NYSE 


702035600 


0.54 


0.74 
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